Your browser doesn't support javascript.
Show: 20 | 50 | 100
Results 1 - 20 de 20
Filter
1.
Int J Mol Sci ; 25(7)2024 Mar 26.
Article in English | MEDLINE | ID: mdl-38612505

ABSTRACT

SARS-CoV-2 has accumulated many mutations since its emergence in late 2019. Nucleotide substitutions leading to amino acid replacements constitute the primary material for natural selection. Insertions, deletions, and substitutions appear to be critical for coronavirus's macro- and microevolution. Understanding the molecular mechanisms of mutations in the mutational hotspots (positions, loci with recurrent mutations, and nucleotide context) is important for disentangling roles of mutagenesis and selection. In the SARS-CoV-2 genome, deletions and insertions are frequently associated with repetitive sequences, whereas C>U substitutions are often surrounded by nucleotides resembling the APOBEC mutable motifs. We describe various approaches to mutation spectra analyses, including the context features of RNAs that are likely to be involved in the generation of recurrent mutations. We also discuss the interplay between mutations and natural selection as a complex evolutionary trend. The substantial variability and complexity of pipelines for the reconstruction of mutations and the huge number of genomic sequences are major problems for the analyses of mutations in the SARS-CoV-2 genome. As a solution, we advocate for the development of a centralized database of predicted mutations, which needs to be updated on a regular basis.


Subject(s)
COVID-19 , Humans , COVID-19/genetics , SARS-CoV-2/genetics , Mutagenesis , Mutation , Nucleotides
2.
Angew Chem Int Ed Engl ; 63(19): e202400551, 2024 May 06.
Article in English | MEDLINE | ID: mdl-38416545

ABSTRACT

Detecting low-frequency DNA mutations hotspots cluster is critical for cancer diagnosis but remains challenging. Quantitative PCR (qPCR) is constrained by sensitivity, and allele-specific PCR is restricted by throughput. Here we develop a long blocker displacement amplification (LBDA) coupled with qPCR for ultrasensitive and multiplexed variants detection. By designing long blocker oligos to perfectly match wildtype sequences while mispairing with mutants, long blockers enable 14-44 nt enrichment regions which is 2-fold longer than normal BDA in the experiments. For wild template with a specific nucleotide, LBDA can detect different mutation types down to 0.5 % variant allele frequency (VAF) in one reaction, with median enrichment fold of 1,000 on 21 mutant DNA templates compared to the wild type. We applied LBDA-qPCR to detect KRAS and NRAS mutation hotspots, utilizing a single plex assay capable of covering 81 mutations and tested in synthetic templates and colorectal cancer tissue samples. Moreover, the mutation types were verified through Sanger sequencing, demonstrating concordance with results obtained from next generation sequencing. Overall, LBDA-qPCR provides a simple yet ultrasensitive approach for multiplexed detection of low VAF mutations hotspots, presenting a powerful tool for cancer diagnosis and monitoring.


Subject(s)
Mutation , Humans , Proto-Oncogene Proteins p21(ras)/genetics , Colorectal Neoplasms/genetics , Colorectal Neoplasms/diagnosis , Membrane Proteins/genetics , Nucleic Acid Amplification Techniques/methods , GTP Phosphohydrolases/genetics
3.
BMC Plant Biol ; 22(1): 550, 2022 Nov 29.
Article in English | MEDLINE | ID: mdl-36443690

ABSTRACT

BACKGROUND: Saussurea is one of the most species-rich genera in the Cardueae, Asteraceae. There are approximately 40 Saussurea species distributed in Korea, with nearly 40% of them endemics. Infrageneric relationships remain uncertain due to insufficient resolutions and low statistical support. In this study, we sequenced the plastid genomes of five Korean endemic Saussurea (S. albifolia, S. calcicola, S. diamantica, S. grandicapitula, and S. seoulensis), and comparative analyses including two other endemics (S. chabyoungsanica and S. polylepis) were conducted. RESULTS: The plastomes of Korean endemics were highly conserved in gene content, order, and numbers. Exceptionally, S. diamantica had mitochondrial DNA sequences including two tRNAs in SSC region. There were no significant differences of the type and numbers of SSRs among the seven Korean endemics except in S. seoulensis. Nine mutation hotspots with high nucleotide diversity value (Pi > 0.0033) were identified, and phylogenetic analysis suggested that those Korean endemic species most likely evolved several times from diverse lineages within the genus. Moreover, molecular dating estimated that the Korean endemic species diverged since the late Miocene. CONCLUSIONS: This study provides insight into understanding the plastome evolution and evolutionary relationships of highly complex species of Saussurea in Korean peninsula.


Subject(s)
Asteraceae , Genome, Plastid , Saussurea , Saussurea/genetics , Phylogeny , Republic of Korea
4.
Front Microbiol ; 12: 753823, 2021.
Article in English | MEDLINE | ID: mdl-34733263

ABSTRACT

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the cause of the ongoing coronavirus disease 2019 (COVID-19) pandemic. Understanding the influence of mutations in the SARS-CoV-2 gene on clinical outcomes is critical for treatment and prevention. Here, we analyzed all high-coverage complete SARS-CoV-2 sequences from GISAID database from January 1, 2020, to January 1, 2021, to mine the mutation hotspots associated with clinical outcome and developed a model to predict the clinical outcome in different epidemic strains. Exploring the cause of mutation based on RNA-dependent RNA polymerase (RdRp) and RNA-editing enzyme, mutation was more likely to occur in severe and mild cases than in asymptomatic cases, especially A > G, C > T, and G > A mutations. The mutations associated with asymptomatic outcome were mainly in open reading frame 1ab (ORF1ab) and N genes; especially R6997P and V30L mutations occurred together and were correlated with asymptomatic outcome with high prevalence. D614G, Q57H, and S194L mutations were correlated with mild and severe outcome with high prevalence. Interestingly, the single-nucleotide variant (SNV) frequency was higher with high percentage of nt14408 mutation in RdRp in severe cases. The expression of ADAR and APOBEC was associated with clinical outcome. The model has shown that the asymptomatic percentage has increased over time, while there is high symptomatic percentage in Alpha, Beta, and Gamma. These findings suggest that mutation in the SARS-CoV-2 genome may have a direct association with clinical outcomes and pandemic. Our result and model are helpful to predict the prevalence of epidemic strains and to further study the mechanism of mutation causing severe disease.

5.
Plants (Basel) ; 10(10)2021 Sep 22.
Article in English | MEDLINE | ID: mdl-34685791

ABSTRACT

The genus Hosta, which has a native distribution in temperate East Asia and a number of species ranging from 23 to 40, represents a taxonomically important and ornamentally popular plant. Despite its taxonomic and horticultural importance, the genus Hosta has remained taxonomically challenging owing to insufficient diagnostic features, continuous morphological variation, and the process of hybridization and introgression, making species circumscription and phylogenetic inference difficult. In this study, we sequenced 11 accessions of Hosta plastomes, including members of three geographically defined subgenera, Hosta, Bryocles, and Giboshi, determined the characteristics of plastomes, and inferred their phylogenetic relationships. We found highly conserved plastomes among the three subgenera, identified several mutation hotspots that can be used as barcodes, and revealed the patterns of codon usage bias and RNA editing sites. Five positively selected plastome genes (rbcL, rpoB, rpoC2, rpl16, and rpl20) were identified. Phylogenetic analysis suggested (1) the earliest divergence of subg. Hosta, (2) non-monophyly of subg. Bryocles and its two sections (Lamellatae and Stoloniferae), (3) a sister relationship between H. sieboldiana (subg. Giboshi) and H. ventricosa (subg. Bryocles), and (4) reciprocally monophyletic and divergent lineages of H. capitata in Korea and Japan, requiring further studies of their taxonomic distinction.

6.
Biol Rev Camb Philos Soc ; 96(3): 822-841, 2021 06.
Article in English | MEDLINE | ID: mdl-33615674

ABSTRACT

The separation of germ cell populations from the soma is part of the evolutionary transition to multicellularity. Only genetic information present in the germ cells will be inherited by future generations, and any molecular processes affecting the germline genome are therefore likely to be passed on. Despite its prevalence across taxonomic kingdoms, we are only starting to understand details of the underlying micro-evolutionary processes occurring at the germline genome level. These include segregation, recombination, mutation and selection and can occur at any stage during germline differentiation and mitotic germline proliferation to meiosis and post-meiotic gamete maturation. Selection acting on germ cells at any stage from the diploid germ cell to the haploid gametes may cause significant deviations from Mendelian inheritance and may be more widespread than previously assumed. The mechanisms that affect and potentially alter the genomic sequence and allele frequencies in the germline are pivotal to our understanding of heritability. With the rise of new sequencing technologies, we are now able to address some of these unanswered questions. In this review, we comment on the most recent developments in this field and identify current gaps in our knowledge.


Subject(s)
Germ Cells , Meiosis , Biological Evolution , Genome , Meiosis/genetics , Mutation
7.
Trends Genet ; 37(8): 717-729, 2021 08.
Article in English | MEDLINE | ID: mdl-33199048

ABSTRACT

Mutation of the human genome results in three classes of genomic variation: single nucleotide variants; short insertions or deletions; and large structural variants (SVs). Some mutations occur during normal processes, such as meiotic recombination or B cell development, and others result from DNA replication or aberrant repair of breaks in sequence-specific contexts. Regardless of mechanism, mutations are subject to selection, and some hotspots can manifest in disease. Here, we discuss genomic regions prone to mutation, mechanisms contributing to mutation susceptibility, and the processes leading to their accumulation in normal and somatic genomes. With further, more accurate human genome sequencing, additional mutation hotspots, mechanistic details of their formation, and the relevance of hotspots to evolution and disease are likely to be discovered.


Subject(s)
Genome, Human/genetics , Genomics , Mutation/genetics , DNA Replication/genetics , Genomic Structural Variation/genetics , Humans , Polymorphism, Single Nucleotide/genetics , Recombination, Genetic/genetics
8.
DNA Repair (Amst) ; 90: 102852, 2020 06.
Article in English | MEDLINE | ID: mdl-32388005

ABSTRACT

When its DNA is damaged, Escherichia coli induces the SOS response, which consists of about 40 genes that encode activities to repair or tolerate the damage. Certain alleles of the major SOS-control genes, recA and lexA, cause constitutive expression of the response, resulting in an increase in spontaneous mutations. These mutations, historically called "untargeted", have been the subject of many previous studies. Here we re-examine SOS-induced mutagenesis using mutation accumulation followed by whole-genome sequencing (MA/WGS), which allows a detailed picture of the types of mutations induced as well as their sequence-specificity. Our results confirm previous findings that SOS expression specifically induces transversion base-pair substitutions, with rates averaging about 60-fold above wild-type levels. Surprisingly, the rates of G:C to C:G transversions, normally an extremely rare mutation, were induced an average of 160-fold above wild-type levels. The SOS-induced transversion showed strong sequence specificity, the most extreme of which was the G:C to C:G transversions, 60% of which occurred at the middle base of 5'GGC3'+5'GCC3' sites, although these sites represent only 8% of the G:C base pairs in the genome. SOS-induced transversions were also DNA strand-biased, occurring, on average, 2- to 4- times more often when the purine was on the leading-strand template and the pyrimidine on the lagging-strand template than in the opposite orientation. However, the strand bias was also sequence specific, and even of reverse orientation at some sites. By eliminating constraints on the mutations that can be recovered, the MA/WGS protocol revealed new complexities of SOS "untargeted" mutations.


Subject(s)
Escherichia coli/genetics , Mutagenesis , Mutation , SOS Response, Genetics , DNA, Bacterial/metabolism , DNA-Directed DNA Polymerase/metabolism , Mutation Rate , Whole Genome Sequencing
9.
Int J Mol Sci ; 20(23)2019 Nov 26.
Article in English | MEDLINE | ID: mdl-31779118

ABSTRACT

Species identification of oaks (Quercus) is always a challenge because many species exhibit variable phenotypes that overlap with other species. Oaks are notorious for interspecific hybridization and introgression, and complex speciation patterns involving incomplete lineage sorting. Therefore, accurately identifying Quercus species barcodes has been unsuccessful. In this study, we used chloroplast genome sequence data to identify molecular markers for oak species identification. Using next generation sequencing methods, we sequenced 14 chloroplast genomes of Quercus species in this study and added 10 additional chloroplast genome sequences from GenBank to develop a DNA barcode for oaks. Chloroplast genome sequence divergence was low. We identified four mutation hotspots as candidate Quercus DNA barcodes; two intergenic regions (matK-trnK-rps16 and trnR-atpA) were located in the large single copy region, and two coding regions (ndhF and ycf1b) were located in the small single copy region. The standard plant DNA barcode (rbcL and matK) had lower variability than that of the newly identified markers. Our data provide complete chloroplast genome sequences that improve the phylogenetic resolution and species level discrimination of Quercus. This study demonstrates that the complete chloroplast genome can substantially increase species discriminatory power and resolve phylogenetic relationships in plants.


Subject(s)
Chloroplasts/genetics , DNA Barcoding, Taxonomic/methods , Quercus/classification , Evolution, Molecular , Genetic Markers , Genome, Chloroplast , High-Throughput Nucleotide Sequencing , Mutation , Phylogeny , Quercus/genetics , Sequence Analysis, DNA
10.
Bioessays ; 41(3): e1800152, 2019 03.
Article in English | MEDLINE | ID: mdl-30801747

ABSTRACT

Somatic mutations arising in human skin cancers are heterogeneously distributed across the genome, meaning that certain genomic regions (e.g., heterochromatin or transcription factor binding sites) have much higher mutation densities than others. Regional variations in mutation rates are typically not a consequence of selection, as the vast majority of somatic mutations in skin cancers are passenger mutations that do not promote cell growth or transformation. Instead, variations in DNA repair activity, due to chromatin organization and transcription factor binding, have been proposed to be a primary driver of mutational heterogeneity in melanoma. However, as discussed in this review here, recent studies indicate that chromatin organization and transcription factor binding also significantly modulate the rate at which UV lesions form in DNA. The authors propose that local variations in lesion susceptibility may be an important driver of mutational hotspots in melanoma and other skin cancers, particularly at binding sites for ETS transcription factors.


Subject(s)
DNA Damage/radiation effects , DNA Repair/radiation effects , Melanoma/genetics , Mutation/radiation effects , Skin Neoplasms/genetics , Ultraviolet Rays/adverse effects , Binding Sites/genetics , Humans , Mutagenesis/radiation effects , Mutation Rate , Nucleic Acid Conformation , Nucleosomes/radiation effects , Promoter Regions, Genetic/genetics , Proto-Oncogene Proteins c-ets/metabolism
11.
Clin Chim Acta ; 487: 264-269, 2018 Dec.
Article in English | MEDLINE | ID: mdl-30292630

ABSTRACT

BACKGROUND: Primary hypertrophic osteoarthropathy (PHO) is a genetically and clinically heterogeneous systematic disorder caused by mutations in genes HPGD and SLCO2A1. The purpose of the present study is to provide useful information for the early and precise diagnosis of PHO and identify causative mutations in Chinese PHO children. METHODS AND RESULTS: The clinical manifestations, radiographic features of seven Chinese pediatric patients were systematically analyzed. Targeted exome sequencing identified a previously reported c.310_311delCT mutation and a novel common splicing site mutation c.324 + 5G > A in the HPGD gene. Relative quantitative real time PCR validated a novel deletion of the exon 4 in the same gene. Neither mutations nor structural variations in the gene SLCO2A1 were detected. CONCLUSIONS: In the present study, homozygous or compound heterozygous HPGD mutations were identified in seven Chinese pediatric patients, suggesting an autosomal recessive inheritance. The c.310_311delCT mutation and the splicing site mutation c.324 + 5G > A were likely to be mutational hotspots in Chinese PHO patients. For the first time, a structural variation of the HPGD gene was reported. Homozygous, compound heterozygous mutations or structural variation identified in the HPGD gene proposed that targeted exome sequencing may be a preferable method for pediatric PHO diagnosis and mutation analysis.


Subject(s)
DNA Mutational Analysis , Osteoarthropathy, Primary Hypertrophic/genetics , Child , Child, Preschool , China , Exome , Female , Gene Deletion , Humans , Male , Mutation , Real-Time Polymerase Chain Reaction
12.
Molecules ; 23(7)2018 Jun 26.
Article in English | MEDLINE | ID: mdl-29949900

ABSTRACT

Herbaceous bamboos (Olyreae) are a separate lineage with idiosyncratic traits, e.g., unisexual flowers and annual or seasonal flowering lifestyle, in the grass family. To elucidate the evolution of herbaceous bamboos we produced two complete chloroplast (cp) genomes from two monotypic genera i.e., Froesiochloa and Rehia via the genome-skimming approach. The assembled F. boutelouoides and R. nervata cp genomes were 135,905 and 136,700 base-pair (bp), respectively. Further whole-genome comparative analyses revealed that the cp genes order was perfectly collinear, but the inverted repeats (IRs) borders, i.e., the junctions between IRs and single copy regions, were highly divergent in Olyreae. The IRs expansions/contractions occurred frequently in Olyreae, which have caused gene content and genome size variations, e.g., the copy number reduction of rps19 and trnH(GUG) genes in Froesiochloa. Subsequent nucleotide mutation analyses uncovered a greatly heterogeneous divergence pattern among different cpDNA regions in Olyreae cp genomes. On average, non-coding loci evolved at a rate of circa 1.9 times faster than coding loci, from which 20 rapidly evolving loci were determined as potential genetic markers for further studies on Olyreae. In addition, the phylogenomic analyses from 67 grass plastomes strongly supported the phylogenetic positions of Froesiochloa and Rehia in the Olyreae.


Subject(s)
Genome, Chloroplast , Genome, Plant , Inverted Repeat Sequences/genetics , Mutation/genetics , Poaceae/genetics , Bayes Theorem , Chromosome Mapping , Genetic Loci , Genetic Markers , Likelihood Functions , Molecular Sequence Annotation , Phylogeny , Polymorphism, Single Nucleotide/genetics
13.
Genetics ; 209(4): 1029-1042, 2018 08.
Article in English | MEDLINE | ID: mdl-29907647

ABSTRACT

Mismatch repair (MMR) is a major contributor to replication fidelity, but its impact varies with sequence context and the nature of the mismatch. Mutation accumulation experiments followed by whole-genome sequencing of MMR-defective Escherichia coli strains yielded ≈30,000 base-pair substitutions (BPSs), revealing mutational patterns across the entire chromosome. The BPS spectrum was dominated by A:T to G:C transitions, which occurred predominantly at the center base of 5'NAC3'+5'GTN3' triplets. Surprisingly, growth on minimal medium or at low temperature attenuated these mutations. Mononucleotide runs were also hotspots for BPSs, and the rate at which these occurred increased with run length. Comparison with ≈2000 BPSs accumulated in MMR-proficient strains revealed that both kinds of hotspots appeared in the wild-type spectrum and so are likely to be sites of frequent replication errors. In MMR-defective strains transitions were strand biased, occurring twice as often when A and C rather than T and G were on the lagging-strand template. Loss of nucleotide diphosphate kinase increases the cellular concentration of dCTP, which resulted in increased rates of mutations due to misinsertion of C opposite A and T. In an mmr ndk double mutant strain, these mutations were more frequent when the template A and T were on the leading strand, suggesting that lagging-strand synthesis was more error-prone, or less well corrected by proofreading, than was leading strand synthesis.


Subject(s)
Amino Acid Substitution , DNA Mismatch Repair , Escherichia coli/genetics , Whole Genome Sequencing/methods , DNA Replication , Genome, Bacterial , Point Mutation
14.
Genetics ; 209(4): 1043-1054, 2018 08.
Article in English | MEDLINE | ID: mdl-29907648

ABSTRACT

When the DNA polymerase that replicates the Escherichia coli chromosome, DNA polymerase III, makes an error, there are two primary defenses against mutation: proofreading by the ϵ subunit of the holoenzyme and mismatch repair. In proofreading-deficient strains, mismatch repair is partially saturated and the cell's response to DNA damage, the SOS response, may be partially induced. To investigate the nature of replication errors, we used mutation accumulation experiments and whole-genome sequencing to determine mutation rates and mutational spectra across the entire chromosome of strains deficient in proofreading, mismatch repair, and the SOS response. We report that a proofreading-deficient strain has a mutation rate 4000-fold greater than wild-type strains. While the SOS response may be induced in these cells, it does not contribute to the mutational load. Inactivating mismatch repair in a proofreading-deficient strain increases the mutation rate another 1.5-fold. DNA polymerase has a bias for converting G:C to A:T base pairs, but proofreading reduces the impact of these mutations, helping to maintain the genomic G:C content. These findings give an unprecedented view of how polymerase and error-correction pathways work together to maintain E. coli's low mutation rate of 1 per 1000 generations.


Subject(s)
DNA Replication , DNA, Bacterial/genetics , Escherichia coli/genetics , Whole Genome Sequencing/methods , DNA Damage , DNA Mismatch Repair , DNA Polymerase III/metabolism , Escherichia coli Proteins/metabolism , Mutation Rate , SOS Response, Genetics
15.
Int J Mol Sci ; 19(4)2018 Mar 30.
Article in English | MEDLINE | ID: mdl-29601491

ABSTRACT

Eucommia ulmoides (E. ulmoides), the sole species of Eucommiaceae with high importance of medicinal and industrial values, is a Tertiary relic plant that is endemic to China. However, the population genetics study of E. ulmoides lags far behind largely due to the scarcity of genomic data. In this study, one complete chloroplast (cp) genome of E. ulmoides was generated via the genome skimming approach and compared to another available E. ulmoides cp genome comprehensively at the genome scale. We found that the structure of the cp genome in E. ulmoides was highly consistent with genome size variation which might result from DNA repeat variations in the two E. ulmoides cp genomes. Heterogeneous sequence divergence patterns were revealed in different regions of the E. ulmoides cp genomes, with most (59 out of 75) of the detected SNPs (single nucleotide polymorphisms) located in the gene regions, whereas most (50 out of 80) of the indels (insertions/deletions) were distributed in the intergenic spacers. In addition, we also found that all the 40 putative coding-region-located SNPs were synonymous mutations. A total of 71 polymorphic cpDNA fragments were further identified, among which 20 loci were selected as potential molecular markers for subsequent population genetics studies of E. ulmoides. Moreover, eight polymorphic cpSSR loci were also developed. The sister relationship between E. ulmoides and Aucuba japonica in Garryales was also confirmed based on the cp phylogenomic analyses. Overall, this study will shed new light on the conservation genomics of this endangered plant in the future.


Subject(s)
Eucommiaceae/genetics , Genome, Chloroplast/genetics , DNA, Chloroplast/genetics , Evolution, Molecular , Genetics, Population , Mutation/genetics , Phylogeny
16.
J Cancer ; 8(2): 162-173, 2017.
Article in English | MEDLINE | ID: mdl-28243320

ABSTRACT

Background: To support cancer therapy, development of low cost library preparation techniques for targeted next generation sequencing (NGS) is needed. In this study we designed and tested a PCR-based library preparation panel with limited target area for sequencing the top 12 somatic mutation hot spots in colorectal cancer on the GS Junior instrument. Materials and Methods: A multiplex PCR panel was designed to amplify regions of mutation hot spots in 12 selected genes (APC, BRAF, CTNNB1, EGFR, FBXW7, KRAS, NRAS, MSH6, PIK3CA, SMAD2, SMAD4, TP53). Amplicons were sequenced on a GS Junior instrument using ligated and barcoded adaptors. Eight samples were sequenced in a single run. Colonic DNA samples (8 normal mucosa; 33 adenomas; 17 adenocarcinomas) as well as HT-29 and Caco-2 cell lines with known mutation profiles were analyzed. Variants found by the panel on APC, BRAF, KRAS and NRAS genes were validated by conventional sequencing. Results: In total, 34 kinds of mutations were detected including two novel mutations (FBXW7 c.1740:C>G and SMAD4 c.413C>G) that have not been recorded in mutation databases, and one potential germline mutation (APC). The most frequently mutated genes were APC, TP53 and KRAS with 30%, 15% and 21% frequencies in adenomas and 29%, 53% and 29% frequencies in carcinomas, respectively. In cell lines, all the expected mutations were detected except for one located in a homopolymer region. According to re-sequencing results sensitivity and specificity was 100% and 92% respectively. Conclusions: Our NGS-based screening panel denotes a promising step towards low cost colorectal cancer genotyping on the GS Junior instrument. Despite the relatively low coverage, we discovered two novel mutations and obtained mutation frequencies comparable to literature data. Additionally, as an advantage, this panel requires less template DNA than sequence capture colon cancer panels currently available for the GS Junior instrument.

17.
Hum Mutat ; 37(10): 1042-50, 2016 10.
Article in English | MEDLINE | ID: mdl-27363847

ABSTRACT

Cancer and developmental disorders (DDs) share dysregulated cellular processes such as proliferation and differentiation. There are well-known genes implicated in both in cancer and DDs. In this study, we aim to quantify this genetic connection using publicly available data. We found that among DD patients, germline damaging de novo variants are more enriched in cancer driver genes than non-drivers. We estimate that cancer driver genes comprise about a third of DD risk genes. Additionally, de novo likely-gene-disrupting variants are more enriched in tumor suppressors, and about 40% of implicated de novo damaging missense variants are located in cancer somatic mutation hotspots, indicating that many genes have a similar mode of action in cancer and DDs. Our results suggest that we can view tumors as natural laboratories for assessing the deleterious effects of mutations that are applicable to germline variants and identification of causal genes and variants in DDs.


Subject(s)
Genetic Predisposition to Disease , Neoplasms/genetics , Neurodevelopmental Disorders/genetics , Computational Biology/methods , Databases, Genetic , Germ-Line Mutation , Humans , Mutation, Missense
18.
Hum Mutat ; 37(9): 865-76, 2016 09.
Article in English | MEDLINE | ID: mdl-27328919

ABSTRACT

TP53 gene mutations are one of the most frequent somatic events in cancer. The IARC TP53 Database (http://p53.iarc.fr) is a popular resource that compiles occurrence and phenotype data on TP53 germline and somatic variations linked to human cancer. The deluge of data coming from cancer genomic studies generates new data on TP53 variations and attracts a growing number of database users for the interpretation of TP53 variants. Here, we present the current contents and functionalities of the IARC TP53 Database and perform a systematic analysis of TP53 somatic mutation data extracted from this database and from genomic data repositories. This analysis showed that IARC has more TP53 somatic mutation data than genomic repositories (29,000 vs. 4,000). However, the more complete screening achieved by genomic studies highlighted some overlooked facts about TP53 mutations, such as the presence of a significant number of mutations occurring outside the DNA-binding domain in specific cancer types. We also provide an update on TP53 inherited variants including the ones that should be considered as neutral frequent variations. We thus provide an update of current knowledge on TP53 variations in human cancer as well as inform users on the efficient use of the IARC TP53 Database.


Subject(s)
Databases, Genetic , Mutation , Neoplasms/genetics , Tumor Suppressor Protein p53/genetics , Data Curation , Genetic Predisposition to Disease , Genomics , Humans , Phenotype , Protein Domains , Software , Tumor Suppressor Protein p53/chemistry
19.
Genome Biol Evol ; 7(1): 262-71, 2014 Dec 23.
Article in English | MEDLINE | ID: mdl-25539726

ABSTRACT

High levels of genetic diversity exist among natural isolates of the bacterium Pseudomonas fluorescens, and are especially elevated around the replication terminus of the genome, where strain-specific genes are found. In an effort to understand the role of genetic variation in the evolution of Pseudomonas, we analyzed 31,106 base substitutions from 45 mutation accumulation lines of P. fluorescens ATCC948, naturally deficient for mismatch repair, yielding a base-substitution mutation rate of 2.34 × 10(-8) per site per generation (SE: 0.01 × 10(-8)) and a small-insertion-deletion mutation rate of 1.65 × 10(-9) per site per generation (SE: 0.03 × 10(-9)). We find that the spectrum of mutations in prophage regions, which often contain virulence factors and antibiotic resistance, is highly similar to that in the intergenic regions of the host genome. Our results show that the mutation rate varies around the chromosome, with the lowest mutation rate found near the origin of replication. Consistent with observations from other studies, we find that site-specific mutation rates are heavily influenced by the immediately flanking nucleotides, indicating that mutations are context dependent.


Subject(s)
DNA Mismatch Repair/genetics , Genetic Variation , Host-Pathogen Interactions/genetics , Mutation Rate , DNA Replication/genetics , Drug Resistance, Bacterial/genetics , Humans , INDEL Mutation/genetics , Pseudomonas fluorescens/genetics
20.
Genomics ; 103(5-6): 349-56, 2014.
Article in English | MEDLINE | ID: mdl-24727706

ABSTRACT

A major objective for evolutionary biology is to identify regions affected by positive selection. High dN/dS values for proteins and accelerated lineage-specific substitution rates for non-coding regions are considered classic signatures of positive selection. However, these could also be the result of non-adaptive phenomena, such as GC-biased gene conversion (gBGC), which favors the fixation of strong (C/G) over weak (A/T) nucleotides. Recent estimates indicate that gBGC affected up to 20% of regions with signatures of positive selection. Here we evaluate the impact of gBGC through its molecular signature of weak-to-strong mutational hotspots. We implemented specific modifications to the test proposed by Tang and Lewontin (1999) for identifying regions of differential variability and applied it to regions previously investigated for the influence of gBGC. While we found significant agreement with previous reports, our results suggest a smaller influence of gBGC than previously estimated, warranting further development of methods for its detection.


Subject(s)
Gene Conversion , Mutation Rate , Algorithms , Animals , Base Composition , Base Sequence , Computer Simulation , Consensus Sequence , DNA Mutational Analysis , Genome, Human , Humans , Models, Genetic , Molecular Sequence Data , Pituitary Adenylate Cyclase-Activating Polypeptide/genetics , Sequence Alignment
SELECTION OF CITATIONS
SEARCH DETAIL